Experimental Evaluation of a Caching Technique for ILP
نویسندگان
چکیده
Inductive Logic Programming (ILP) is a Machine Learning technique that has been quite successful in knowledge discovery for relational domains. ILP systems implemented in Prolog challenge the limits of Prolog systems due to heavy usage of resources such as database accesses and memory usage, and to very long execution times. The major reason to implement ILP systems in Prolog is that the inference mechanism implemented by the Prolog engine is fundamental to most ILP learning algorithms. ILP systems can therefore benefit from the extensive performance improvement work that has taken place for Prolog. On the other hand, ILP is a non-classical Prolog application because it uses large sets of ground facts and requires storing a large search tree. One major criticism of ILP systems is that they often have long running times. A technique that tries to tackle this problem is coverage caching [5]. Coverage caching stores previous results in order to avoid recomputation. Naturally, this technique uses the Prolog internal database to store results. The question is: does coverage caching successfully reduce the ILP systems running time? To obtain an answer to this question we evaluated the impact of the coverage caching technique using the April [1] ILP system with the YAP Prolog system. To understand the results obtained we profiled April’s execution and present initial results. The contribution of this paper is twofold: to an ILP researcher it provides an evaluation of the coverage caching technique implemented in Prolog using well known datasets; to a Prolog implementation researcher it shows the need of efficient internal database indexing mechanisms.
منابع مشابه
On the Implementation of an ILP System with Prolog
Inductive Logic Programming (ILP) systems is a set of Machine Learning techniques that have been quite successful in knowledge discovery in relational domains. These systems implemented in Prolog are among the most successfull ILP systems. They challenge the limits of Prolog systems due to heavy usage of resources, such as database accesses and memory usage, and very long execution times. In th...
متن کاملA New Goal programming approach for cross efficiency evaluation
Cross efficiency evaluation was developed as an extension of DEA. But the traditional DEA models usually have alternative optimal solutions and, as a result, cross efficiency scores may not be unique. It is recommended that without changing the DEA efficiency scores, the secondary goal should be introduced for optimization of the inputs/outputs weights. Several reports evaluated the perfo...
متن کاملبررسی تأثیر نرم کننده پرانرژی مایع یونی بر پایه ایمیدازولیوم بر خواص حرارتی نیتروسلولز
In this paper investigates an energetic imidazolium ionic liquid plasticizer (ILP) effect on the degradation kinetics of nitrocellulose, which is a important component of double based solid propellants. For better comparison and evaluation, diethyl phthalate (DEP) plasticizer, which has a structure similar to ILP, was also evaluated. Heat of combustion analysis was performed to evaluate the en...
متن کاملA Framework for Set-Oriented Computation in Inductive Logic Programming and Its Application in Generalizing Inverse Entailment
We propose a new approach to Inductive Logic Programming that systematically exploits caching and offers a number of advantages over current systems. It avoids redundant computation, is more amenable to the use of set-oriented generation and evaluation of hypotheses, and allows relational DBMS technology to be more easily applied to ILP systems. Further, our approach opens up new avenues such a...
متن کاملImprove Replica Placement in Content Distribution Networks with Hybrid Technique
The increased using of the Internet and its accelerated growth leads to reduced network bandwidth and the capacity of servers; therefore, the quality of Internet services is unacceptable for users while the efficient and effective delivery of content on the web has an important role to play in improving performance. Content distribution networks were introduced to address this issue. Replicatin...
متن کامل